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SYSTEM AND METHOD FOR CHECKING A CONTENT 

SITE FOR EFFICACY 

BACKGROUND OF THE INVENTION 
The present invention deals with generating 
5 content, accessible over a network such as a web. 
More specifically, the present invention deals with 
verifying the effectiveness of web content so that 
the chances of a web site being presented first by a 
search engine in response to a keyword search is 
10 increased. 

In order for a business, or content 
provider, to have network information available and 
searchable by a network search engine, the business 
or content provider generally submits its content for 

15 indexing by the search engine. The indexing process 
is conventional and well known. 

Conventional search engines use a tool 
referred to as a spider, or crawler. The crawler 
accesses sites on a computer network (which may be a 

2 0 global computer network such as the Internet or World 
Wide Web) and generates lists of words that are found 
on those sites. The crawler also follows each link 
on the site it is currently crawling. Based on the 
words and links, the web crawler creates an index of 

25 the words associated with the uniform resource 
locator (URL) of the site on which the crawler found 
the words . 

When the search engine is used by a user 
attempting to locate information on the network, the 
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user typically types in one or more keywords that 
form the basis of a search. The search engine then 
searches its index based on the keywords entered by 
the user and returns a list of web sites related to 
5 those keywords. By performing certain commonly known 
indexing and analysis techniques, the conventional 
search engine will generally rank order the list of 
web sites based on how closely they are believed to 
be related to the keywords entered by the user. 

10 Of course, the content provider or business 

typically wants its web site to be listed first in 
results returned by the search engine when relevant 
keywords are entered. There have been some attempts 
to arrange content on web pages in such a way as to 

15 optimize the web pages for searching (i.e., to 
increase the chance that the content provider's web 
site will be returned in a relatively high position 
in the rank ordered search results) . 

SUMMARY OF THE INVENTION 

2 0 The present invention provides a system and 

method for automatically suggesting optimizations 
that can be made to content pages to increase the 
chances that a network site containing the content 
page will be indexed and returned high in the rank 

25 ordered list of results from a search engine. In one 
embodiment, the present invention also includes a 
keyword generation tool for use in generating 
effective keywords for which a content page can be 
optimized. 
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In accordance with another embodiment, the 
present invention uses hierarchical rules that apply 
in determining the effectiveness of a web site. The 
hierarchical rules can be configured to apply 
5 differently based on how important the keyword is to 
a network site. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a block diagram of one 
illustrative embodiment of an environment in which 
10 the present invention can be used. 

FIG. 2 is a block diagram of one 
illustrative embodiment of a network content 
processing system in accordance with the present 
invention. 

15 FIG. 3 is a flow diagram illustrating the 

operations of the system shown in FIG. 2, in 
accordance with one illustrative embodiment of the 
present invention . 

FIG. 4 is a flow diagram illustrating the 

2 0 operation of a keyword selection tool in accordance 
with one embodiment of the present invention. 

FIGS. 5A-5F are screen shots further 
illustrating the operation of a keyword selection 
tool in accordance with one embodiment of the present 

25 invention. 

FIG. 6 is a screen shot illustrating an 
overview report generated by the system shown in FIG. 
2 , in accordance with one embodiment of the present 
invention. 
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FIGS. 7A-7C are screen shots illustrating a 
broken links report generated by the system shown in 
FIG. 2, in accordance with one embodiment of the 
present invention. 
5 FIGS. 8A-8B are screen shots illustrating 

an incoming links report generated by the system 
shown in FIG. 2, in accordance with one embodiment of 
the present invention. 

FIG. 9 is a screen shot illustrating a link 

10 download time report generated by the system shown in 
FIG. 2, in accordance with one embodiment of the 
present invention. 

FIGS. 10A-10B are screen shots illustrating 
a readiness check generated by the system shown in 

15 FIG. 2, in accordance with one embodiment of the 
present invention. 

Appendix A is one illustrative list of 
messages that indicate rules applied in checking 
content pages for readiness. 

2 0 DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 

The present invention deals with generating 
content pages that will be accessible through a 
search engine over a computer network. More 
specifically, the present invention deals with a 

25 system that checks to determine whether content pages 
are configured in a proper way to increase the 
chances that they will be indexed and returned by a 
search engine in response to a keyword search. The 
present invention can be used to examine content in a 

30 network environment or in a standalone environment. 



However, before describing the present invention in 
greater detail, one illustrative embodiment of an 
environment in which the present invention can be 
used is discussed. 

FIG. 1 illustrates an example of a suitable 
computing system environment 100 on which the 
invention may be implemented. The computing system 
environment 100 is only one example of a suitable 
computing environment and is not intended to suggest 
any limitation as to the scope of use or 
functionality of the invention. Neither should the 
computing environment 100 be interpreted as having 
any dependency or requirement relating to any one or 
combination of components illustrated in the 
exemplary operating environment 100. 

The invention is operational with numerous 
other general purpose or special purpose computing 
system environments or configurations. Examples of 
well known computing systems, environments, and/or 
configurations that may be suitable for use with the 
invention include, but are not limited to, personal 
computers, server computers, hand-held or laptop 
devices , multiprocessor systems , microprocessor-based 
systems, set top boxes, programmable consumer 
electronics, network PCs, minicomputers, mainframe 
computers, distributed computing environments that 
include any of the above systems or devices, and the 
like . 

The invention may be described in the 
general context of computer-executable instructions, 
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such as program modules, being executed by a 
computer. Generally, program modules include 

routines, programs, objects, components, data 
structures, etc. that perform particular tasks or 
5 implement particular abstract data types. The 
invention may also be practiced in distributed 
computing environments where tasks are performed by 
remote processing devices that are linked through a 
communications network. In a distributed computing 

10 environment, program modules may be located in both 
local and remote computer storage media including 
memory storage devices . 

With reference to FIG. 1, an exemplary 
system for implementing the invention includes a 

15 general purpose computing device in the form of a 
computer 110. Components of computer 110 may 

include, but are not limited to, a processing unit 
120, a system memory 130, and a system bus 121 that 
couples various system components including the 

20 system memory to the processing unit 120. The system 
bus 121 may be any of several types of bus structures 
including a memory bus or memory controller, a 
peripheral bus, and a local bus using any of a 
variety of bus architectures. By way of example, and 

25 not limitation, such architectures include Industry 
Standard Architecture (ISA) bus, Micro Channel 
Architecture (MCA) bus, Enhanced ISA (EISA) bus, 
Video Electronics Standards Association (VESA) local 
bus, and Peripheral Component Interconnect (PCI) bus 

30 also known as Mezzanine bus. 
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Computer 110 typically includes a variety 
of computer readable media. Computer readable media 
can be any available media that can be accessed by 
computer 110 and includes both volatile and 
5 nonvolatile media, removable and non-removable media. 
By way of example, and not limitation, computer 
readable media may comprise computer storage media 
and communication media. Computer storage media 
includes both volatile and nonvolatile, removable and 

10 non-removable media implemented in any method or 
technology for storage of information such as 
computer readable instructions, data structures, 
program modules or other data. Computer storage 
media includes, but is not limited to, RAM, ROM, 

15 EEPROM, flash memory or other memory technology, CD- 
ROM, digital versatile disks (DVD) or other optical 
disk storage, magnetic cassettes, magnetic tape, 
magnetic disk storage or other magnetic storage 
devices, or any other medium which can be used to 

20 store the desired information and which can be 
accessed by computer 100. Communication media 

typically embodies computer readable instructions, 
data structures, program modules or other data in a 
modulated data signal such as a carrier WAV or other 

25 transport mechanism and includes any information 
delivery media. The term "modulated data signal" 
means a signal that has one or more of its 
characteristics set or changed in such a manner as to 
encode information in the signal. By way of example, 

30 and not limitation, communication media includes 
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wired media such as a wired network or direct -wired 
connection, and wireless media such as acoustic, FR, 
infrared and other wireless media. Combinations of 
any of the above should also be included within the 
5 scope of computer readable media. 

The system memory 130 includes computer 
storage media in the form of volatile and/or 
nonvolatile memory such as read only memory (ROM) 131 
and random access memory (RAM) 132. A basic 

10 input/output system 133 (BIOS) , containing the basic 
routines that help to transfer information between 
elements within computer 110, such as during start- 
up, is typically stored in ROM 131. RAM 132 
typically contains data and/or program modules that 

15 are immediately accessible to and/or presently being 
operated on by processing unit 12 0. By way o 
example, and not limitation, FIG. 1 illustrates 
operating system 134, application programs 135, other 
program modules 136, and program data 137. 

20 The computer 110 may also include other 

removable/non- removable volatile/nonvolatile computer 
storage media. By way of example only, FIG. 1 
illustrates a hard disk drive 141 that reads from or 
writes to non- removable, nonvolatile magnetic media, 

25 a magnetic disk drive 151 that reads from or writes 
to a removable, nonvolatile magnetic disk 152, and an 
optical disk drive 155 that reads from or writes to a 
removable, nonvolatile optical disk 156 such as a CD 
ROM or other optical media. Other removable/ non - 

30 removable, volatile/nonvolatile computer storage 
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media that can be used in the exemplary operating 
environment include, but are not limited to, magnetic 
tape cassettes, flash memory cards, digital versatile 
disks, digital video tape, solid state RAM, solid 
5 state ROM, and the like. The hard disk drive 141 is 
typically connected to the system bus 121 through a 
non-removable memory interface such as interface 140, 
and magnetic disk drive 151 and optical disk drive 
155 are typically connected to the system bus 121 by 

10 a removable memory interface, such as interface 150. 

The drives and their associated computer 
storage media discussed above and illustrated in FIG. 
1, provide storage of computer readable instructions, 
data structures, program modules and other data for 

15 the computer 110. In FIG. 1, for example, hard disk 
drive 141 is illustrated as storing operating system 
144, application programs 145, other program modules 
14 6, and program data 14 7. Note that these 

components can either be the same as or different 

20 from operating system 134, application programs 135, 
other program modules 136, and program data 137. 
Operating system 144, application programs 145, other 
program modules 14 6, and program data 14 7 are given 
different numbers here to illustrate that, at a 

25 minimum, they are different copies. 

A user may enter commands and information 
into the computer 110 through input devices such as a 
keyboard 162, a microphone 163, and a pointing device. 
161, such as a mouse, trackball or touch pad. Other 

30 input devices (not shown) may include a joystick, 



game pad, satellite dish, scanner, or the like. 
These and other input devices are often connected to 
the processing unit 120 through a user input 
interface 160 that is coupled to the system bus, but 
5 may be connected by other interface and bus 
structures, such as a parallel port, game port or a 
universal serial bus (USB) . A monitor 191 or other 
type of display device is also connected to the 
system bus 121 via an interface, such as a video 

10 interface 190. In addition to the monitor, computers 
may also include other peripheral output devices such 
as speakers 197 and printer 196, which may be 
connected through an output peripheral interface 190. 

The computer 110 may operate in a networked 

15 environment using logical connections to one or more 
remote computers, such as a remote computer 180. The 
remote computer 180 may be a personal computer, a 
hand-held device, a server, a router, a network PC, a 
peer device or other common network node, and 

2 0 typically includes many or all of the elements 
described above relative to the computer 110. The 
logical connections depicted in FIG. 1 include a 
local area network (LAN) 171 and a wide area network 
(WAN) 173, but may also include other networks. Such 

25 networking environments are commonplace in offices, 
enterprise -wide computer networks, intranets and the 
Internet . 

When used in a LAN networking environment, 
the computer 110 is connected to the LAN 171 through 
30 a network interface or adapter 170. When used in a 
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WAN networking environment, the computer 110 
typically includes a modem 172 or other means for 
establishing communications over the WAN 173, such as 
the Internet. The modem 172, which may be internal 
5 or external, may be connected to the system bus 121 
via the user- input interface 160, or other 
appropriate mechanism. In a networked environment, 
program modules depicted relative to the computer 
110, or portions thereof, may be stored in the remote 

10 memory storage device. By way of example, and not 
limitation, FIG. 1 illustrates remote application 
programs 185 as residing on remote computer 180. It 
will be appreciated that the network connections 
shown are exemplary and other means of establishing a 

15 communications link between the computers may be 
used. 

It will be understood that the present 
discussion may proceed with respect to a global 
computer network (such as the Internet or World Wide 

20 Web) . However, the present invention is not so 
limited but could be used on any searchable network, 
and the discussion herein is exemplary only. 

FIG. 2 is a block diagram of a web content 
processing system 2 00 in accordance with one 

25 embodiment of the present invention. System 2 00 
includes crawler and readiness checking component 
2 02, keyword generator 2 04, user interface 2 06, and 
rule store 208. System 200 is also shown connected 
to a content store 210 and to one or more search 

30 engines 212. 
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In one illustrative embodiment, system 200 
is configured to crawl through the entire site 
represented by content store 210 based on a keyword 
phrase entered by the user. Then user is shown all 
5 pages that are ready for submission to a search 
engine for indexing. The user can select pages for 
optimization as well. In optimizing a page, system 
200 is configured to access web pages or content 
pages 214 in content store 210 and determine whether 

10 they are written and laid out in a manner which is 
likely to increase the possibility that they will be 
returned at a relatively high position in the rank 
ordered list of web sites returned by conventional 
search engines in response to user queries. 

15 This operation of system 200 is illustrated 

by the flow diagram shown in FIG. 3. First, crawler 
and readiness checker component 202 (component 202) 
receives keywords based on a user input. This is 
illustrated by block 250 in FIG. 3. The keywords can 

20 simply be manually entered by a user through user 
interface 206. Alternatively, the user can invoke 
keyword generator 2 04 which automatically generates 
possible keywords for selection by the user. The 
operation of keyword generator 2 04 is discussed in 

25 greater detail below, with respect to FIGS. 4 and 5A- 
5F. Suffice it to say, for the present discussion 
that component 2 02 receives keywords. 

Once the keywords are received, component 
202 accesses rules in rule store 208. This is 

30 indicated by block 252 in FIG. 3. The rules are used 
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by component 2 02, (some in conjunction with the 
keywords entered) in scrutinizing the content pages 
214 in content store 210 to determine whether the 
content in content store 210 is written and laid out 
5 in an efficient manner for ready indexing and return 
by a conventional search engine. Some of the rules 
are described in greater detail below. However, for 
the sake of example, the rules can include such 
things as whether a keyword is found within a title 

10 tag on pages 214, whether meta tags exist for the 
keywords, whether the uniform resource locator (URL) 
redirects the crawler to an unreachable URL, whether 
the URL is formatted properly, etc. 

The crawler in component 2 02 crawls through 

15 the content and formatting on the pages 214 in 
content store 210, applying the rules from rule store 
208 to determine whether the content or formatting 
complies with, or violates, any of the rules being 
applied. Crawling the content pages and applying the 

20 rules is indicated by block 254 in FIG. 3. 

Component 2 02 then outputs a report to the 
user, again illustratively through user interface 
206. This is indicated by block 256 in FIG. 3. The 
reports can take a wide variety of different forms, 

25 but generally indicate how effective the content 
pages 214 on content store 210 will be in achieving 
indexing and a high ranking in the rank ordered list 
of web sites returned by search engines when 
searching based on queries input by a user. A number 
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of illustrative reports will be described below with 
respect to FIGS. 6-10B. 

FIG. 4 is a flow diagram illustrating the 
operation of keyword generator 204 in greater detail. 
5 The flow diagram of FIG. 4 will be discussed in 
conjunction with the screen shots illustrated in 
FIGS. 5A-5F. 

In order to determine whether keyword 
generator 204 will be invoked, component 202 first 

10 receives from the user through user interface 2 06, a 
selection as to the mode by which keywords will be 
input. One embodiment of such a screen shot is 
illustrated in FIG. 5A. It can be seen that the user 
can simply make a selection indicating that the user 

15 wishes to input her or his own keywords, or that the 
user wishes to use the keyword generator tool (or 
keyword research tool) 204. If the user wishes to 
enter keywords manually, a suitable screen is simply 
presented such that the user can enter the desired 

20 keywords. However, for the sake of example, it is 
assumed that the user wishes to invoke keyword 
generator 2 04, and that selection is shown on FIG. 
5A. Selection of the mode by which keywords are 
input is indicated by block 300 in FIG. 4. 

25 When the user has selected the mode 

indicating that keyword generator 2 04 is to be used, 
component 2 02 then receives from the user, through 
user interface 206, one or more root keywords which 
the user desires to initiate the process of keyword 

3 0 selection with. These root keywords are 
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illustratively words that describe what the user's 
content page to be analyzed is about. One 
illustrative screen shot for receiving the root 
keywords from the user is shown in FIG. 5B. Receiving 
5 the keyword roots from the user is illustrated by 
block 302 in FIG. 4 . 

Some search engines offer information that 
can be used to identify alternative keywords. For 
instance, such search engines track the keywords used 

10 by an individual user in a given search process. 
These search engines can be queried for this 
information to locate alternative keywords. An 
initial keyword is input and the search engine 
returns additional words used by other users who also 

15 used the initial keyword in conducting a search. 

Thus, keyword generator 2 04 accesses one or 
more search engines 212 to obtain a list of 
alternative keywords that could be used by the user 
in describing the content of the content store 210. 

20 Invoking the keyword generator to identify additional 
possible keywords is illustrated by block 304 in FIG. 
4. One illustrative screen shot of a returned set of 
alternative keywords is illustrated in FIG. 5C. 

Component 2 02 then requests the user to 

25 select all of the returned keywords which are 
applicable to, or related to, the content of the 
user's content page to be checked. In doing so, the 
user can simply select the relevant keywords on the 
screen shot shown in FIG. 5C. Receiving the keyword 
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selection by the user is indicated by block 306 in 
FIG. 4. 

Component 202 then performs statistical 
analysis on the selected keywords in order to 
5 determine which are most effective as search terms in 
uniquely identifying the content page. This can be 
done in a wide variety of ways. However, in one 
illustrative embodiment, component 2 02 invokes 
information from the records kept by search engines 

10 212 to determine how many searches were run using 
each of the keywords selected, and also how many 
search engine results are returned based on the 
search using that keyword. 

For instance, if a search term is used a 

15 very large number of times, and there are only a very 
few result listings returned for that search term, 
then it is determined that the search term will be 
quite highly effective in uniquely identifying the 
content page and obtaining a high ranking in the rank 

20 ordered search results. However, if a search term is 
not used by many searchers (i.e., if not many 
searches are performed using that term) but the 
number of search results returned using that term is 
relatively high, then the search term will be less 

25 effective in obtaining a high rank in a rank ordered 
list of search results. One embodiment of the 
statistical processing uses a ratio of these numbers. 
Based on this statistical processing, component 202 
returns to the user through user interface 206 a rank 

30 ordered list of keywords. One screen shot 
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illustrating such a rank ordered list is shown in 
FIG. 5D, and presenting that list is illustrated by 
block 308 in FIG. 4. 

As the screen shot in FIG. 5D illustrates, 
5 component 2 02 allows the user to select up to a 
predetermined number of the displayed keywords for 
use in analyzing its content page. In the embodiment 
illustrated in FIG. 5D, the user is allowed to choose 
up to three words. Receiving a user selection of 
10 this keyword subset is illustrated by block 310 is 
FIG. 4. 

Component 2 02 then displays that subset of 
words to the user and requests that the user select 
one of those keywords as the primary keyword. This 

15 is illustrated by block 310 in FIG. 4. One 
illustrative screen shot which allows the user to 
select the primary keyword is shown in FIG. 5E. 

Once the keywords are selected and the 
primary keyword is identified, component 2 02 has 

20 sufficient information to perform a readiness check 
on the specified web page 214 in content store 210. 
FIG. 5F is one illustrative screen shot illustrating 
this . 

As discussed with respect to FIG. 3, 
25 component 202 then accesses rules in rule store 208 
and applies those rules to the content page 214 being 
examined in content store 210. The rules may 
illustratively be hierarchically selected. In other 
words, some of the rules may be more strictly applied 
30 when examining the content page 214 using the primary 
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keyword, than when examining the content page 214 
using the remaining keywords. Similarly, more rules 
may be applied when examining the content page 214 
with respect to the primary keyword than with respect 
5 to the other keywords. In any case, crawler 2 02 
examines the content of a given web page 214 in 
content store 210 applying the rules from rule store 
208. The particular rules applied can take a wide 
variety of different forms, and can be modified based 

10 on empirical data. One illustrative embodiment of 
errors identified by applying the rules is set out in 
appendix A. Of course, it will be noted that this 
list of errors is illustrative only. 

After examining all of the pages 214 in 

15 content store 210, component 202 provides a report to 
the user through user interface 206. Of course, the 
report can take a wide variety of different forms, 
but a number of different illustrative embodiments of 
such reports are illustrated in FIGS. 6-10B. 

20 FIG. 6 illustrates an overview report for 

the entire site that contains web pages 214 based on 
the initial crawl through the entire site. In one 
illustrative embodiment, each of the pages 214 are 
examined separately, in a separate operation, for 

25 optimization using selected keywords. However, the 
overall web site containing those pages 214 is the 
subject of the overview report shown in FIG. 6. It 
can be seen that the embodiment of the overview 
report in FIG. 6 gives such information as the number 

30 of pages analyzed, the number of pages ready to 
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submit (for which no changes are suggested) the 
number of pages needing work (for which changes and 
I optimizations are suggested) , the average download 

time, the number of links to the site under 
i 5 examination, the number of broken links (which when 

followed did not lead to a viable site) and the total 
number of mouse clicks to the submitted pages (which 
is illustratively shown for a page only after the 
page is submitted) . 

10 When the user clicks each of those items 

shown in FIG. 6, additional detailed information is 
shown. For example, FIG. 7A illustrates one 

illustrative embodiment of a screen shot that appears 
when a user clicks on the "broken links" information 

15 item in FIG. 6. The complete list of broken links 
can be shown, or it can be abbreviated. 

The user can then select one of the broken 
links shown in FIG. 7A and select the "view link 
details' 7 button. In that case, additional 

20 information will be displayed with respect to that 
link, such as the information shown in the 
illustrative screen shot set out in FIG. 7B. That 
information includes such things as the 
identification of the broken link and an error code 

25 associated with that link. By selecting the error 
code, additional information relating to the 
displayed error code will be provided, such as shown 
in the screen shot illustrated by FIG. 7C. Of 
course, the broken links information can be provided 

30 in a number of different forms and that shown in 
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FIGS. 7A-7C is but one illustrative way to present 
the information. 

The reports provided by component 2 02 can 
also include a report of incoming links or those web 
5 pages which have links to the present web site under 
consideration. One illustrative screen shot for 
showing this information is shown in FIG. 8A. If the 
user elects the "view" input in FIG. 8A, the web 
sites which contain links to the web site under 
10 consideration are shown. One illustrative screen 
shot for showing this information is shown in FIG. 
8B. 

The reports output by component 2 02 can 
also include a download time report. Such a report 

15 can include such information as how long it takes the 
page to load. One illustrative screenshot for 
showing this information is set out in FIG. 9. 

Component 202 will also illustratively 
output a readiness check report. Such a report will 

2 0 illustratively be provided for each page 214 of the 
web site under consideration. The readiness check 
report will include information that indicates how 
effectively the page will be used by search engines. 
In other words, the information will give the user an 

25 indication as to how likely it is that any of the 
user's web pages 214 will be ranked high in the list 
of search results returned by a search engine using 
the keywords selected. 

In one illustrative embodiment, component 

30 202 not only outputs a report indicating problems 
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with an associated web page, but also outputs 
suggested actions which can be taken to remedy or 
reduce the problems. FIGS. 10A and 10B are screen 
shots illustrating one embodiment of such a readiness 
5 report. It can be seen in FIG. 10A that component 
2 02 flags problems associated with such things as the 
page setup, and the searchability of the page with 
respect to the primary keywords and the other 
keywords selected by the user. Recall that the rules 

10 applied when scrutinizing the information on a 
content page 214 may differ based on whether the key 
word being used to scrutinize the page is a primary 
or secondary keyword. 

In any case, the boxes associated with each 

15 of the areas of scrutinization shown in FIG. 10A can 
illustratively be provided with a marker indicative 
of whether those issues turned out to be a problem. 
For instance, it can be seen that both "URL issues" 
and "spam" issues have a check mark adjacent them. 

20 This indicates that the site has passed the check for 
those particular items. However, the "page issues" 
item has an exclamation point next to it. This 
indicates that the check found a minor problem with 
the URL for that particular item. Another indicator, 

25 such as an "x" can be used to indicate that the URL 
has a serious problem with a particular item, which 
could greatly affect the success of the URLs 
submission to a search engine. 

By clicking on any of the issues listed in 

30 FIG. 10A, the user is shown to an explanation of the 
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issues found, and illustratively a suggestion as to 
how to address the problem. A number of these 
descriptions and suggestions are shown at the bottom 
half of FIG. 10A and in FIG. 10B. For example, the 
5 "page issues" are described as problems detected with 
the setup of the HTML code, or page in general, that 
could effect the ability to obtain listing in a 
search engine. Then, the specific problems found are 
discussed. One such problem shown in FIG. 10B is 

10 that the page does not appear to have a description 
meta tag within the HTML code. The description then 
goes on to suggest a fix for that problem, and even 
provide the correct format for such a tag. 
Additional examples of issues which are found by the 

15 analysis performed by component 202 and reported to 
the user are shown in FIG. 10B. Again, of course, 
this information can be provided to the user in a 
number of different ways and that shown in FIGS. 10A 
and 10B is illustrative only. Also, additional or 

20 different issues can be the subject of the scrutiny 
and analysis in component 202, and those listed in 
FIG. 10A are illustrative only. 

It can thus be seen that the present 
invention provides a component which can be used by a 

2 5 network content provider to select keywords to be 
identified in the content. The present invention can 
also be used to scrutinize a content provider's web 
pages to determine how effective they will be when 
subjected to searches by conventional search engines. 

30 Similarly, the present invention can be used to 
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identify problems that may arise in attempting to get 
a web site or web pages listed at, and indexed by, 
search engines . 

Although the present invention has been 
5 described with reference to particular embodiments, 
workers skilled in the art will recognize that 
changes may be made in form and detail without 
departing from the spirit and scope of the invention. 



